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Model- based  object  recognition  commonly  involves  using  a  minimal  sejl-of-matched  model  and 
image  points  to  compute  the  pose  of  the  model  in  image  coordinates.  Furthermore,  recognition 
systems  often  rely  on  the  “weak  perspective”  imaging  model  in  place  of  the  perspective  imaging 
model.  This  paper  discusses  computing  the  pose  of  a  model  from  three  corresponding  points 
under  weak-perspective  projection.  A  new  solution  to  the  problem  is  proposed  which,  like 
previous  solutions,  involves  solving  a  biquadratic  equation.  Here  the  biquadratic  is  motivated 
geometrically  and  its  solutions,  comprised  of  an  actual  and  a  false  solution,  are  interpreted 
graphically.  The  final  equations  take  a  new  form,  which  lead  to  a  simple  expression  for  the 
image  position  of  any  unmatched  model  point. 
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1  Introduction 


Recognizing  an  object  generally  requires  finding  correspondences  between  features  of  a  model 
and  an  image.  Since  finding  corresponding  features  often  requires  trying  all  possible  correspon¬ 
dences,  recognition  systems  frequently  use  correspondences  between  miniTriAl  sets  of  features 
to  compute  poses  of  the  model.  For  instance,  "alignment”  techniques  repeatedly  hypothesize 
correspondences  between  minimal  sets  of  model  and  image  features,  and  then  use  those  corre¬ 
spondences  to  conq)ute  model  poses,  which  are  used  to  find  other  model-image  correspondences 
(e.g.,  [5],  [10],  [1],  [9],  [28],  [29],  [15],  [3],  [16]-[18],  [30],  [19]).  In  addition,  “pose  clustering” 
techniques  use  every  correspondence  between  a  minimal  set  of  model  and  unage  features  to 
compute  a  model  pose,  and  then  count  the  number  of  times  each  pose  is  repeated  (e.g.,  [2], 
[26],  [25],  [23],  [11],  [4]). 

For  computing  poses  of  3D  objects  from  2D  images,  a  model  of  projection  must  be  selected, 
and  typically  either  perspective  or  “weak-perspective”  projection  is  chosen.  Weak-perspective 
projection  is  an  orthographic  projection  plus  a  scaling,  which  serves  to  approximate  perspective 
projection  by  assuming  that  all  points  on  a  3D  object  are  at  roughly  the  same  distance  from  the 
camera.  For  both  perspective  and  weak-perspective  projections,  the  minimal  number  of  points 
needed  to  compute  a  model  pose  up  to  a  finite  number  of  solutions  is  three  ([10],  [18]).  For 
point  features,  then,  the  problem  is  to  determine  the  pose  of  three  points  in  space  given  three 
corresponding  image  points.  When  perspective  projection  is  the  imaging  model,  the  problem 
is  known  as  the  “perspective  three-point  problem”  [10].  When  weak-perspective  is  used,  I  shall 
call  the  problem  the  “weak-perspective  three-point  problem.” 

A  few  methods  for  solving  the  weak-perspective  three-point  problem  have  been  suggested  in 
the  past  ([20],  [8],  [17],  [18],  [12]),  and  this  paper  proposes  anew  method  (solution).  The  major 
differences  with  the  new  solution  is  that  it  motivates  and  explains  the  solution  geometrically, 
and  it  does  not  compute  a  model-to-image  transformation  as  an  intermediate  step.  As  will  be 
demonstrated  later,  understanding  the  geometry  is  useful  for  seeing  under  which  circumstances 
the  solution  simplifies  or  breaks,  and  for  analyzing  where  the  solution  is  stable.  Furthermore, 
a  geometric  imderstanding  may  be  useful  for  seeing  how  the  solution  is  affected  by  error  in  the 
image  and  the  modeL 

In  addition  to  providing  a  geometric  interpretation,  the  solution  in  this  paper  gives  direct 
expressions  for  the  three  matched  model  points  in  image  coordinates,  as  well  as  an  expression 
for  the  position  in  the  image  of  any  additional,  unmatched  model  point.  Earlier  methods  all 
require  the  intermediate  computation  of  a  model-to-image  transformation.  This  is  meaxiingfiil 
because,  as  mentioned  above,  many  alignment-based  recognition  systems  calculate  the  3D  pose 
solution  many  times  while  searching  for  the  correct  pose  of  the  model.  Consequently,  avoiding 
the  intermediate  calculation  of  the  transformation  could  cause  such  systems  to  run  faster. 

To  illustrate  how  significant  such  a  speed-up  can  be,  consider  a  system  that  performs  3D 
recognition  by  alignment  using  point  features  to  generate  h]rpotheses.  The  input  to  the  system 
is  a  model  and  an  image,  and  the  goal  is  to  identify  all  instances  of  the  model  in  the  image. 
The  model  is  specified  by  a  set  of  3D  points  thed  can  be  detected  reliably  in  images,  along  with 
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any  numb«  of  extended  fezdutes'  whdse  projections  can  be  predicted  using  points  (e.g.,  line 
segments,  some  sets  of  curves,  and  ed^s  represented  point-by-point).  From  the  image,  a  set  of 
2D  points  is  extracted  by  a  low-level  process  that  looks  for  points  of  the  type  corresponding  to 
points  in  the  model.  The  alignment  algorithm  proceeds  as  follows: 

1.  Hypothesize  a  correspondence  between  three  model  points  and  three  image  points. 

2.  Compute  the  3D  pose  of  the  model  from  the  three-point  correspondence. 

3.  Predict  the  image  positions  of  the  renuuning  model  points  and  extended  features  using 
the  3D  pose. 

4.  Verify  whether  the  hypothesis  is  correct  by  looking  in  the  image  near  the  predicted  posi¬ 
tions  of  the  model  features  for  corresponding  image  features. 

This  process  is  repeated  until  aU  pairs  of  triples  of  model  and  image  (sensed)  points  have  been 
tried.  For  m  model  points  and  s  sensed  points,  there  are  (7)  (^3!  distinct  pirns  of  model  and 
image  point  triples.  Consequently,  the  running  time  for  the  algorithm  grows  with  the  cubes 
of  the  numbers  of  model  and  image  points.  Since  these  numbers  can  be  large,  the  model  and 
image  points  t]rpically  are  grouped  in  advance  so  that  only  triples  of  points  from  the  groups 
have  to  be  tried  (e.g.,  [23],  [21],  [18]).  This  can  bring  the  number  of  pairs  of  triples  into  a 
range  where  the  algorithm  is  practical.  Then  a  “constant-times”  speed-up  in  the  innermost 
loop  of  the  algorithm,  that  is,  steps  2-4  listed  above,  could  give  a  substantial  improvement  in 
the  overall  execution  time.  As  already  suggested,  the  solution  given  in  this  paper  should  make 
steps  2  and  3  significantly  &ster. 

From  observing  previous  solutions,  the  solution  given  m  this  paper  most  resembles  UUman’s 
([28],  [17]),  in  that  both  end  up  having  to  solve  the  same  biquadratic  equation,  although  each 
derives  the  biquadratic  differently.  In  this  sense,  the  solution  given  here  is  an  extension  of 
Ullman’s,  because,  unlike  Ullman’s  solution,  it  resolves  which  of  the  two  non-equivalent  solutions 
to  the  biquadratic  is  correct.  In  addition,  this  p^er  explains  graphically  why  the  two  solutions 
arise  and  to  what  geometry  each  corresponds. 

There  is  an  intrinsic  geometry  that  rmderlies  the  perspective  three-point  problem;  it  is 
shown  in  Fig.  1.  In  the  figure,  the  three  model  points,  fho,  mi,  and  rhi,  are  being  perspectively 
projected  onto  three  image  points,  to,  ti,  and  tj,  via  lines  through  the  center  of  projection  (center 
point),  p.  The  task  is  to  recover  mo,  mi,  and  m2.  The  essential  information  is  contained  in  the 
side  lengths  and  angles  of  the  surrounding  tetrahedron. 

Similar  to  the  perspective  case,  there  is  an  intrinsic  geometry  underlying  the  weak-perspec¬ 
tive  three-point  problem,  shown  in  Fig.  2.  The  picture  shows  the  three  model  pdnts  being 
projected  orthographically  onto  the  plane  that  contiuns  mo  and  is  parallel  to  the  image  plane, 
and  then  shows  them  being  scaled  down  into  the  image.  In  addition,  the  picture  shows  the 
model  points  first  being  scaled  down  and  then  projected  onto  the  image  plane.  In  each  case, 
the  projection  is  represented  by  a  solid  with  right  angles  as  shown.  The  snudler  solid  is  a 
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Figure  1:  Model  points  nibi  *^2  undergoing  perspective  projecti<m  to  produce 

image  points  io,  ti,  and  tj.  a,  6,  and  c  are  distances  from  the  center  point,  p,  to  the 
model  points. 


scaled-down  version  of  the  larger.  The  relevant  information  consists  of  the  side  lengths  of  the 
solids  and  the  scale  factor. 

In  what  follows,  first  the  perspective  case  is  discussed  (Section  2).  Then  I  summarize 
how  to  compute  3D  pose  from  three  corresponding  points  under  weak-perspective  projection 
(Section  3).  Third  the  3D  pose  solution  is  shown  to  exist  and  be  unique,  and  a  geometrical 
interpretation  is  provided  (Section  4).  Next  a  direct  expression  is  derived  for  the  image  position 
of  an  unmatched  model  point  (Section  5).  Then  I  review  earlier  solutions  to  the  problem  and 
present  the  three  most  related  solutions  in  detail  (Sections  7  and  8).  In  addition,  the  new  and 
earlier  solutions  are  examined  and  compared  in  terms  of  their  stabilities  (Sections  6  <uid  9). 


2  The  Perspective  Solution 

To  see  the  difference  between  the  perspective  and  weak-perspective  cases,  first  let  us  observe 
exactly  what  is  required  for  the  perspective  three-point  problem.  As  pictured  in  Fig.  1, 1  will 
work  in  camera-centered  coordinates  with  the  center  point  at  the  origin  and  the  line  of  si^t 
along  the  z  axis.  The  distances  Aoi,  Ao2»  and  Ais  come  from  the  original,  untransformed  model 
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points.  The  angles  doh  ^02t  ^12  can  be  computed  from  the  positions  of  the  image  points, 

the  focal  length,  and  the  center  point.  To  see  this,  let  /  equal  the  focal  length,  and  let  the 
image  points  to,  ti,  t2  be  extended  as  follows:  (a;,p)  — >  {x,y,f).  Then 

COS^oi  =  *0  •  *l5  C08do2  =  to  •  t2»  COS012  =  ti  •  t2,  (l) 

where  in  general  v  denotes  the  unit  vector  in  the  direction  of  v.  The  problem  is  to  determine 
a,  b,  tmd  c  i^ven  iZoii  -Aosi  Rut  cos^oii  cosdo2,  and  cos0i2.  IVom  the  picture,  we  see  by  the 
law  of  cosines  that 


+  b^  -  2abcos$oi  =  Rqi 

(2) 

a*  +  c*  -  2aecosdo2  =  ^2 

(3) 

b^  +  -  25c  cos  ^12  =  Rii 

(4) 

Over  time,  there  have  been  many  solutions  to  the  problem,  all  of  which  start  with  the  above 
equations.  The  solutions  differ  in  how  they  manipulate  the  equations  when  solving  for  the 
unknowns.  Recently,  Haralick  et  al.  reviewed  the  various  solutions  and  examined  their  stabili¬ 
ties  [13]. 

Given  a,  b,  and  c,  we  easily  can  compute  the  3D  locations  of  the  model  points: 


mo  =  a«o,  m2=c»2.  (5) 

If  a  3D  rigid  transformation  is  desired,  it  can  be  determined  from  the  original  3D  model  points 
and  the  3D  camera-centered  model  points  just  computed.  A  simple  method  for  doing  so  is 
given  in  Appendix  A;  for  a  least-squares  solution,  see  Horn  [14]. 

Although  perspective  (central)  projection  is  a  more  accxirate  model,  niunerous  researchers 
have  used  weak-perspective  projection  instead  (e.g.,  [24],  [20],  [7],  [8],  [25],  [28],  [29],  [21],  [22], 
[16]-[18],  [3],  [30],  [19],  [12]).  The  justification  for  using  weak-perspective  is  that  in  many  cases 
it  approximates  perspective  closely.  In  particular,  for  many  imaging  situations  if  the  size  of  the 
model  in  depth  (distance  in  z)  is  small  compared  to  the  depth  of  the  model  centroid,  then  the 
difference  should  be  negligible  [25]. 

There  are  some  advantages  to  using  weak-perspective  instead  of  perspective.  In  particular, 
computations  involving  weak-perspective  often  are  less  conqplicated.  In  addition,  the  weak- 
perspective  math  model  is  conceptually  simpler,  since  it  uses  orthographic  instead  of  perspective 
projection.  Another  advantage  is  that  we  do  not  need  to  know  the  camera  focal  length  or  center 
point.  Furthermore,  there  are  fewer  solutions  to  deal  with — four  for  perspective  and  two  for 
weak-perspective  ([10],  [18]).  It  should  be  understood,  however,  that  finding  two  solutions 
instead  of  four  is  only  an  advantage  if  the  four  solutions  actually  collapse  to  two;  otherwise,  at 
least  two  of  the  solutions  are  missed. 

Lastly,  the  weak-perspective  imaging  model  can  be  used  without  modification  to  recognize 
scaled  versions  of  the  same  object,  since  the  built-in  scale  factor  incorporates  object  scale.  For 
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perspective  to  handle  scale,  an  additional  scale  parameter  must  be  used.  On  the  other  hand, 
weak-perspective  is  unable  distinguish  objects  that  differ  only  in  size,  since  a  smaller  scale  could 
mean  the  object  is  smaller  or  further  away.  Nonetheless,  in  cases  where  the  weak-perspective 
approximation  applies,  the  perspective  solution  may  be  unstable  in  distinguishing  different¬ 
sized  objects  ([27],  [29],  [18]).  In  these  cases,  moving  the  object  ftirther  out  in  depth,  that  is, 
past  the  point  where  perspective  and  weak-perspective  projections  are  essentially  equivalent, 
will  have  the  same  effect  in  the  image  as  uniformly  scaling  the  object  down  in  size.  Since  the 
perspective  solution  always  distinguishes  the  depth  and  size  of  the  object,  this  suggests  that 
small  variations  in  the  image  could  lead  to  very  different  interpretations  for  the  size  as  well  as 
the  depth. 

In  sum,  there  are  significant  advantages  to  using  weak-perspective  in  place  of  perspective, 
and  under  many  viewing  conditions  the  weak-perspective  approximation  is  close  to  perspective. 
As  suggested  in  the  introduction,  for  these  situations  it  would  be  useful  to  know  how  to  solve, 
using  weak-perspective  projection,  the  problems  of  recovering  the  3D  pose  of  a  model  and 
computing  the  image  position  of  a  fovirth  model  point. 


3  Computing  the  Weak- Perspective  Solution 


This  section  provides  a  summary  of  the  results  I  will  derive  in  the  next  two  sections.  Specifically, 
it  tells  how  to  compute  the  locations  of  the  three  matched  model  points  and  the  image  location 
of  any  additional,  unmatched  model  point. 


For  reference,  the  geometry  underlying  weak-perspective  projection  between  three  corre¬ 
sponding  points,  which  was  described  in  the  introduction,  is  shown  in  Fig.  2.  All  that  is 
pertinent  to  recovering  the  3D  pose  of  the  model  are  the  distances  between  the  model  and 
image  points.  Let  the  distances  between  the  model  points  be  (iZoi>  -Roz?  -Riz)!  the  corre¬ 
sponding  distances  between  the  image  points  be  (doi,do3,di2).  Then  the  parameters  of  the 
geometry  in  Fig.  2  are 


(hi,hj) 

where 


(6) 

(7) 

(8) 


a  =  (Boi  +  +  •Ri3)('Roi  “  + -Roa  ~ -Rn)  (9) 

h  =  ‘^i(~Roi  +  ^3  + -^la)  + ‘^aCRoi  ~  ■®03  +  •^la)  + ‘^i3(-®oi  + -^oa  “  ^la)  (19) 

c  =  (dbi -I- doa  +  di2)(-<ioi  +  doa  +  di2){dbi  —  dloa  +  di3)(<loi  +  doa  -  dia)  (11) 
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As  the  eqaati<nis  show,  the  solution  has  a  two-way  ambiguity  except  when  hi  and  hj  are  zero. 
The  ambiguity  corresponds  to  a  reflection  about  a  plane  parallel  to  the  image  plane.  When 
hi  =  hj  =  0,  the  model  triangle  (the  triangle  defined  by  the  three  model  points)  is  parallel  to 
the  image  triangle  (the  triangle  defined  by  the  three  image  points).  As  a  note,  a  and  c  measure 
sixteen  times  the  squares  of  the  areas  of  the  model  and  image  triangles,  respectively.  Further, 
the  solution  fails  when  the  model  triangle  degenerates  to  a  line,  in  which  case  a  =  0;  in  fact, 
this  is  the  only  instance  in  which  a  solution  may  not  exist  (for  a  discussion  of  this  case,  see 
Section  4.5).  Note,  however,  that  no  such  restriction  is  placed  on  the  image  triangle;  so  the 
image  points  may  be  coUinear.  Even  so,  care  should  be  taken  since  the  solution  may  be  unstable 
when  image  points  are  collinear,  when  the  model  points  are  coUinear,  or  when  one  of  the  sides 
of  the  model  triangle  is  parallel  to  the  image  plane  (see  Section  6). 

Next,  I  give  an  expression  for  the  image  location  of  a  fourth  model  point.  OriginaUy,  the 
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models  points  are  in  some  arbitrary  model  coordinate  frame.  Also,  the  image  points  are  in 
a  camera-centered  coordinate  frame  in  which  the  image  serves  as  the  x-y  plane.  Denote  the 
original,  xmtransformed  model  points  by  to  distinguish  them  from  the  camera-centered  model 
points  fhi  shown  in  Fig.  2.  Using  and  pi,  solve  the  following  vector  equation  for  the 

“extended  affine  coordinates,”  (q,/3,7),  of 

Pi  =  Q(ft  -  a)  +  /3(ft  -  ^)  +  7(ft  -  A)  X  (ft  -  ^)  -I-  ft  (13) 

Given  image  points  t'o  =  (*ojyo)>  *i  =  (*i>yi)>  and  tj  =  {x2,yi),  let 

*01  =  *1  -  *o>  yoi  =  y\-  »0i 

*02  =  *2  -  *o>  yo2  =  Vi  -  Vo- 

Then  the  image  location  of  the  transformed  and  projected  ft  is 

(a*oi  -f-  ^*02  +  7(yoi-ff2  -  yo2^i)  +  *o»  otyoi  +  fiVtn  +  7(~*oi-ff2  +  *0iBx)  -I-  ft)*  (14) 

Lastly,  the  weak-perspective  solution  can  be  used  to  compute  the  3D  locations  of  the  model 
points  in  camera-centered  coordinates: 


tUq  = 

i(*o,yoi  w) 

S 

(15) 

mi  = 

-(*i,yi,hi  -ftr) 
s 

(16) 

mj  = 

“(*211/25^2  +  ®)> 

(17) 

where  w  is  an  unknown  offset  in  a  direction  normal  to  the  image  plane.  It  is  worth  noting  that 
if  the  3D  ri^d  transform  that  brings  the  model  into  camerarcentered  coordinates  is  desired,  it 
can  be  computed  from  these  three  camera-centered  model  points  and  the  original  three  model 
points.  The  unknown  offset  w  drops  out  when  conqiuting  the  rotation  and  remains  only  in  the 
z  coordinate  of  the  translation,  which  cannot  be  recovered.  As  mentioned  in  Section  2,  a  simple 
method  for  computing  the  transform  is  given  in  Appendix  A,  and  a  least-squares  solution  was 
given  by  Horn  [14]. 

4  Existence  and  Uniqueness  of  the  3D  Pose  Solution 

In  deriving  the  3D  pose  solution,  I  start  with  the  basic  geometry  for  the  weak-perspective  three- 
point  problem,  shown  in  Fig.  2.  Fig.  3  shows  the  smaller  solid  again  with  more  labels.  There 
are  three  right  triangles  in  the  solid,  from  which  three  constraints  can  be  generated: 


hi  +«loi 

= 

(18) 

hi  "I" 

=  (*i2oa)" 

(19) 

(hi  -  h2)^  +  <^2 

= 

(20) 
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Figure  3:  Smaller  solid  representing  orthographic  projection  plus  scale  of  three  model 
points  into  an  image. 

It  should  be  pointed  out  that  the  distances  i2oi,  R02,  R12,  doi,  dos,  di3  and  the  scale  factor  s 
are  all  positive,  but  the  altitudes  hi,  hj  along  with  Hi,  H2  are  signed.  Since  hi  and  ^2  are 
signed,  having  “hi  -  h^”  in  the  third  equation  is  an  arbitrary  choice  over  “hi  +  h2”;  it  was 
chosen  because,  when  hi  and  are  positive,  it  directly  corresponds  to  the  picture  in  Fig.  3. 

Multiplying  the  third  equation  by  -1  and  adding  all  three  gives 

2hih2  =  +  -Sfij  ~  -^la)  ~  (^1  *^3  ~  ^la)* 

Squaring  and  using  the  first  two  equations  again  to  eliminate  hf  and  h^,  we  have 

4(^*^i  “  doi)(s^i?03  —  djj)  =  ^s*(iioi  +  Rqi  —  ^12)  ~  (doi  +  dJa  “  ‘^la))  »  (21) 

which  leads  to  a  biquadratic  in  s  (for  details  see  Appendix  B): 

as*  -  26s*  +  c  =  0,  (22) 

where 

a  =  4il2i^3  -  («oi  +  iioa  -  Aia)* 

=  (-Roi  +  Ro7  +  ili3)(~-Roi  +  -Roa  +  iiia)(Roi  —  Roa  +  •Ri3)(-Roi  +  Roa  —  Ria) 

6  =  2Rojdg2  +  2Ro2doi  —  (Rqi  +  Roa  ~  Ri3)(‘^i  ~  ^12) 

=  ‘^i(~Roi  +  Roa  +  Ria)  +  *^a(Roa  ~  Roa  +  Ria)  +  ^^laCRoi  +  Roa  ~  Ria) 
c  =  ^i<^  -  (<^i  4-  <^2  ~  **13)* 

=  (doi  +  do2  +  di2)(— doi  +  do2  +  di2)(doi  —  doa  +  di2)(di)i  +  doa  -  di2) 

This  biquadratic  is  equivalent  to  the  one  originally  derived  by  Ullman.  But  Ullman  made  no 
attempt  to  interpret  or  decide  among  its  solutions,  which  will  be  done  here. 
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To  prove  existence  and  uniqueness,  the  biquadratic’s  solutions  must  be  examined.  We  are 
interested  only  in  positive,  real  solutions  for  s,  the  scale  factor.  In  general,  the  positive  solutions 
of  the  biquadratic  are  given  by 


Depending  on  the  radicands,  there  will  be  zero,  one,  or  two  real  solutions.  Particularly,  we 
are  interested  in  whether  each  number  of  solutions  can  arise,  and,  if  so,  to  what  the  solutions 
correspond  geometrically. 

To  begin,  let  us  determine  the  signs  of  a,  b,  and  c.  In  Fig.  2,  let  (f>  denote  the  angle  between 
nil  -  fiio  and  "ia  -  and  let  ^  be  the  angle  between  ii  -  t'o  and  12  -  to-  Notice  by  the  law 
of  cosines  that 


4i2gii2g2  ~  (2Roi-Ko2  cos 

4(iRoiRo3  sin  <f>y 

(24) 

2Roido2  ^'^ojdoi  ~  (2iZoi-Ro2Cos^){2<loido2CosV^) 

2(Roido2  +  ^2^01  -  2Jtoilto2<ioido2  cos  <l>cosil>) 

(25) 

4<^ido2  -  (2<ioido2  cos  rpy 

4(doido2sinV')^ 

(26) 

Further,  \R0tR02  sin^  equals  the  area  of  the  model  triangle,  so  that  o  measures  sixteen  times 
the  square  of  the  area  of  the  model  triangle.  Analogously,  c  measures  sixteen  times  the  square 
of  the  tirea  of  the  image  triangle. 

In  what  follows,  I  assmne  that  the  model  triangle  is  not  degenerate,  that  is,  not  simply  a 
line  or  a  point.  This  situation  is  the  only  time  the  solution  is  not  guaranteed  to  exist.  (For 
a  discussion  of  this  case  see  Section  4.5.)  Note  that  this  assun4>tion  implies  that  a  ^  0  and 

From  Equations  24  and  26,  clearly  a  >  0  and  c  >  0.  From  Equation  25,  it  is  straightforward 
to  see  that  fr  >  0: 

6  =  2(jRoid^  +  ~  2RoiIto7<iot^2  cos  ^  cos 

>  2(Aoido2  +  ^jdoi  -  2RoiRo2<iindo2)i  since  cos  ^  <  1,  cost/)  <  1 

=  2(iloido2  -  -Roadoi)’ 

>  0 

Returning  to  Equation  23,  Appendix  E  shows  that  6^  -  oc  >  0.  From  this  fact  and  that 
a  >  0,  6  >  0,  and  c  >  0,  we  can  derive  that  there  are  in  general  two  solutions  for  s  with  a  single 


special  case  when  —  ac  =  0,  which  can  be  seen  as  follows: 


fc’  -  ac  >  0 


Hence 


h  ±  \/h!^  —  ac  >  0, 
b  ±  y/h!^  —  ac 


since  6  >  0  and  ac  > 
since  a  >  0 


0 


■! 


h  ±  \/h^  —  ac 


which  gives  one  or  two  solutions  for  the  biquadratic,  depending  on  whether  b^  -  ac  Is  positive 
or  equal  to  zero. 

Next,  I  show  that  of  the  two  solutions  for  the  scale,  exactly  one  of  them  is  valid,  that  is,  cor¬ 
responds  to  an  orthographic  projection  of  the  model  points  onto  the  image  points.  Furthermore, 
the  other  solution  arises  from  inverting  the  model  and  image  distances  in  Fig  2.  In  addition, 
there  being  one  solution  for  scale  corresponds  to  the  special  case  in  which  the  model  triangle  is 
parallel  to  the  inu^e  pUme.  The  following  proposition,  which  is  proved  in  Appendix  C,  will  be 
useful  in  establishing  these  clsums. 

Proposition  1:  Let 


lb- y/lP-  ac  Ib+y/W 

s.  =  y - - -  s,  =  y - ^ 


ac 


Then 


doi  doa  - 

Si  < 

not  n^i 


(27) 


(28) 


4.1  The  true  solution  for  scale 

Here  it  is  shown  that  exactly  one  of  the  two  solutions  for  scale  can  satisfy  the  geometry  shown  in 
Fig.  2,  and  it  is  always  the  same  one.  If  the  two  solutions  are  the  same,  then  both  solutions  can 
satisfy  the  geometry  (this  case  is  discussed  in  Section  4.3).  As  will  be  seen,  the  valid  solution  is 


jb  -f-  -  ac 

'  =  V — ^ — • 

Note  that  proving  this  statement  establishes  the  existence  and  uniqueness  of  the  solution  given 
in  Section  3. 
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In  Fig.  2,  (aiZoi)^  ~  >  0  and  (5^02)*  ~  <^2  =  ^2  -  ®»  «^luch  implies  that  any 

solution  s  for  scale  satisfies 


^  <s  and 
iioi 


«f02 

Ro2 


Consequently,  Proposition  1  implies  that  S2  is  the  only  possible  solution.  Still,  the  question 
remains  whether  S2  is  itself  a  solution;  the  fact  that  it  satisfies  the  biquadratic  (Equation  22) 
is  not  sufficient  since  the  steps  used  to  derive  the  biquadratic  from  Equations  18-20  are  not 
always  reversible  due  to  the  squaring  used  to  obtain  Equation  21. 

Next,  I  show  that  S2  is  indeed  a  solution  by  giving  an  assignment  to  the  remaining  variables 
that  satisfies  the  constraints  in  Equations  18-20.  Since  (siioi)*  —  <foi  ^  0  (S.R02)*  -<lo2  ^  0, 

we  can  set  hj  =  (siioi)^  -  and  hi  =  (sjRo2)*  -  <^2>  which  inunediately  give  Equations  18 
and  19.  Furthermore,  we  know  s  satisfies  Equation  22,  or,  equivalently.  Equation  21.  Substitute 
hi  and  hi  into  the  left-hand  side  of  Equation  21: 

Ah\h\  =  +  -^02  ~  ^12)  ~  (^1  +  *^2  ~  ^12))  • 

which  is  the  same  as 

±2hih2  =  +  Rl^  -  Rl^)  -  (4^  +  4,-  4,). 


At  this  point,  we  are  free  to  choose  the  signs  of  hi  and  ^2.  In  particular,  let  the  sign  of  hi 
watch  the  sign  on  the  left-hand  side  so  that 


2hih2  —  s*(Eoi  +  ^2  “  ^12)  ■"  *^2  ~  ‘^12)’  (2®) 

Once  this  choice  is  made,  we  are  forced  to  choose  the  sign  of  h2  to  make  the  sign  of  the  left-hand 
side  consistent  with  the  right-hand  side.  In  particular,  let  a  be  the  sign  of  hi.  Then  unless  the 
right-hand  side  is  0, 


_  f  1  if  s^(<^i  -f  do2  —  <^2)  -^01  ^07  ~  •®12» 

ifsH4^  +  4,-4,)>Rii-i-Rl,-Rl,. 

On  the  other  hand,  if  s^{Ri2  -  ~  ^02)  =  <^2  ~  ~  ^2>  Equation  29  implies  hi  or 

hi  is  0,  so  that  the  sign  of  hj  is  not  forced  and  so  is  arbitrary.  Having  chosen  the  sign  of  hi, 
substituting  hf  and  h|  into  the  right-hand  side  of  Equation  29  gives 


2hih2  =  h?-»-hi-(s2ji;,-d?,), 


or 

(hi-h2)’  =  s’/2?2-<i?2, 


11 


Figure  4:  Geometrically  interpreting  the  inverted  solution  for  scale 


which  is  Equation  20. 

Returning  to  the  signs  of  h\  and  hi,  there  is  two-way  cunbiguity  in  the  sign  of  hi  which 
imposes  the  same  two-way  ambiguity  on  the  pairs  and  {Hi, Hi).  As  can  be  seen  in 

Fig.  2,  the  ambiguity  geometrically  corresponds  to  a  flip  of  the  plane  containing  the  space  points 
rho,  mi,  and  m}.  The  flip  is  about  a  plane  in  space  that  is  parallel  to  the  image  plane,  but 
which  plane  it  is  cannot  be  determined  since  the  problem  gives  no  information  about  offsets 
of  the  model  in  the  z  direction.  Due  to  the  reflection,  for  planar  objects  the  two  solutions  are 
equivalent,  in  that  they  give  the  same  image  points  when  projected.  On  the  other  hand,  for 
non-planar  objects  the  two  solutions  project  to  different  sets  of  image  points. 

There  is  a  special  case,  as  mentioned  above,  when  the  sign  of  is  arbitrary  relative  to  the 
sign  of  hi.  In  this  case,  the  right-hand  side  of  Equation  29  is  zero,  and  this  implies  that  hi  or 
hj  is  zero  also.  Looking  at  Fig.  2,  geometrically  what  is  occurring  is  that  one  of  the  sides  of  the 
model  triangle  that  emanates  from  lito  lies  parallel  to  the  image  plane,  so  that  the  reflective 
ambiguity  is  obtained  by  freely  changing  the  sign  of  the  non-zero  altitude. 

4.2  The  inverted  solution  for  scale 

Of  the  two  solutions  for  scale  that  satisfy  the  biquadratic,  we  know  that  one  of  them 
corresponds  to  the  geometry  in  Fig.  2,  but  what  about  the  other?  Using  a  similar  argument  to 
that  used  to  prove  S2  is  a  solution  for  the  weak-perspective  geometry,  we  can  infer  a  geometric 
interpretati<Hi  for  si.  Ck>nsider,  then,  s  =  si.  The  interpretation  I  will  derive  satisfies  the 
eqiiations. 


El  +  fij, 

=  (rdoi)’ 

(30) 

H2  iZoj 

=  (rdoj)’ 

(31) 

{Hi  -  H2f  +  R\2 

=  {rdi2)\ 

(32) 

where  r  =  ^.  Observe  that  ^  and  $2  have  similar  forms  (see  Equation  27): 


a 

b  —  -  ac 


r 
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To  begin  the  derivation,  Proposition  1  gives  that  dJi  -  (sRoi)^  >  0  and  >  0, 

which  implies  we  can  set  Aj  =  do}-(5i2oi)^  and  Al  =  do2-(«.Ro2)^*  Dividing  through  by  gives 
Equations  30  and  31.  As  before,  since  s  satisfies  Equation  22  and,  equivalently.  Equation  21, 
we  can  substitute  into  Equation  21  with  Af  and  Af  to  obtain 

(Ai  -  Aj)’  =  djj  -  s^rI^, 

where  the  sign  of  A3  relative  to  Ai  is  1  if  +  otherwise. 

Dividing  through  by  gives  Equation  32,  and  so  the  derivaticm  is  completed. 

Geometrically,  Equation  30  forms  a  right  triangle  with  sides  Hi  and  Aoi,  and  hypotenuse 
rdoi-  Analogously,  Equations  31  and  32  imply  right  triangles  as  well  The  interpretation  is 
displayed  in  Fig.  4.  Another  way  to  see  what  is  occxirring  geometrically  is  to  note  that  the  roles  of 
image  and  model  distances  from  Equations  18-20  are  inverted  in  Equations  30-32.  In  effect,  what 
is  happening  is  that  instead  of  scaling  down  the  model  triangle  and  projecting  it  orthographically 
onto  the  image  triangle,  the  image  triangle  is  being  scaled  up  and  projected  orthographically 
onto  the  model  triangle,  that  is,  projected  along  parallel  rays  that  are  perpendicular  to  the 
model  triangle.  This  interpretation  is  shown  in  Fig.  5  as  a  rotated  version  of  Fig.  4. 

4.S  Model  triangle  is  parallel  to  the  image  plane 

The  two  solutions  for  the  scale  factor  are  the  same  when  6^-oc  =  0,  and  here  I  demonstrate  that 
geometrically  this  corresponds  to  the  plane  contiuning  the  three  model  points  being  parallel  to 
the  image  plane.  Before  proving  this,  let  us  establish  the  existence  of  the  solution  for  scale  in 
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this  special  case  of  6*  -  oc  =  {?.  Looking  at  Equation  23, 

6*  -  ac  =  0  ==>■  h  ±  —  ac  =  b 

=>  3  = 

is  a  solution  to  the  biquadratic  since  a  >  0  and  b>  0. 

Appendix  D  shows  that  ft*  —  oc  =  0  exactly  when  <j>  =  itj}  or  4*  =  iV*  +  ^  and  ^ 
Using  this  result  and  Equations  24  and  26, 


3 


[dpidpa  sin  4>\ 
l^iiZoasin^l 


<foi  _  do2 

Roi  Ro2 


(34) 


=►  ^  =  0 

ha  =  ^(sEoa)*  -  <^oa  =  ®* 

Thus  -  oc  =  0  only  if  the  model  triangle  is  parallel  to  the  image  plane. 

Conversely,  if  the  model  triangle  is  parallel  to  the  image  plane,  it  must  be  that  <f>  =  i/>. 
Further,  in  this  case  hi  =  hj  =  0,  so  that 


<fei  _  <fea 
Roi  So2  ’ 


(35) 


which  from  Appendix  D  implies  that  6’  —  oc  =  0. 

Since  the  two  solutions  are  the  same,  we  know  that  3i  =  32  =  ^.  Notice  in  Figs.  3  and  4 
that  the  geometric  interpretations  for  the  two  solutions  for  scale  collapse  to  the  same  solution 
when  hi  =  h2  =  Hi  —  H2  =  9  and  «  =  7-  As  a  result,  when  there  is  one  solution  for  scale, 
there  is  also  one  solution  for  (hi,  h^)  and  {Hi,  Hi),  albeit  (0,0). 

4.4  Model  triangle  is  perpendicular  to  the  image  plane 

The  situation  where  the  model  triangle  is  perpendicular  to  the  image  plane  is  of  interest  since 
the  projection  is  a  line.  Note,  however,  that  the  solution  given  earlier  makes  no  exception  for 
this  case  as  long  as  the  model  triangle  is  not  degenerate.  As  for  what  happens  in  this  case, 
since  the  image  triangle  is  a  line,  we  know  ^  =  0  c  =  0  ==>  Equation  23  becomes 

lb±V^  ^  [u 

s-  y—  -  0,  y- .  (36) 
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Figure  6:  Special  case  where  model  triangle  is  a  line.  The  repeated  labels  correspond 
to  two  different  solutions  for  the  position  of  the  model  that  leave  smi  projecting  onto 
ti.  For  both  solutions  sm}  projects  onto  the  same  image  point. 


As  shown  above,  of  the  two  solutions  for  scale,  the  true  one  is  y  ^  and  the  inverted  one  is  0. 

To  see  why  the  inverted  solution  is  zero,  recall  that  the  solution  can  be  viewed  as  scaling 
and  projecting  the  image  triangle  onto  the  model  triangle,  using  for  scale  r  =  which  in  this 
case  does  not  exist.  Since  the  image  triangle  is  a  line,  graphically  this  amounts  to  trying  to 
scale  a  line  so  that  it  can  project  as  a  triangle,  which  is  not  possible. 

4.5  Model  triangle  is  a  line 

This  is  the  one  case  where  the  solution  for  the  scale  fails,  and  it  faOs  because  a,  which  is 
a  measure  of  the  area  of  the  model  triangle,  is  zero.  Despite  this  fact,  we  can  determine 
when  a  solution  exists.  First,  we  know  that  the  image  triangle  must  be  a  line  as  well.  To 
see  if  this  condition  is  enough,  consider  looking  for  a  3D  rotation  and  scale  that  leaves  »m\ 
orthogrs4>hically  projecting  onto  t'l  as  in  Fig.  6.  Observe  that  every  such  rotatimi  and  scale 
leaves  sma  projecting  onto  the  same  point  in  the  image.  This  means  is  that  for  a  solution  to 
exist,  it  must  be  that 

dpi  _  dpa 
Ron 

Even  when  the  image  triangle  is  a  line,  this  in  general  is  not  true.  When  it  is  true,  there  is  an 
infinity  of  solutions  corresponding  to  every  scaled  rotatiim  that  leaves  am\  projecting  onto  ti. 

Another  way  to  look  at  this  situation  is  to  notice  that  the  model  triangle  being  a  line  when 
using  the  true  solution  is  analogous  to  the  imi^e  trian^e  being  a  line  when  using  the  inverted 
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solution,  where  the  roles  of  the  model  and  image  triangles  are  reversed.  As  discussed  in  the 
previous  section,  the  image  triangle  is  a  line  when  the  model  triangle  is  perpendicular  to  the 
image  plane.  The  anrdysis  there  reveals  that  for  the  inverted  solution  the  scale  factor  r  is 
undefined,  which  means  that  here  the  true  solution  for  the  scale  factor  s  is  undefined  <is  well. 


4.6  Summary 

Our  goal  was  to  determine  the  three  unknown  parameters  of  the  geometry  displayed  in  Fig.  3, 
namely  s,  hi,  and  hj.  The  figure  gave  three  constrrunts  (Equaticms  18-20),  from  which  a 
biquadratic  in  the  scale  factor  s  was  derived.  The  biquadratic  always  has  two  positive  solutions, 
and  its  coefficients,  a,  b,  and  c,  are  all  non-negative.  Of  the  two  solutions.  Section  4.1  showed 
that  one  and  only  one  can  satisfy  the  three  constraints,  and  that  solution  is  s  =  sj  from 
Proposition  1  (see  Equation  27).  Given  s,  there  are  two  pairs  of  valid  assignments  for  hi  and 
h2‘  They  correspond  to  reflecting  the  plane  of  the  three  matched  model  points  about  any  plane 
parallel  to  the  image;  all  planes  parallel  to  image  plane  are  equally-good.  This  proved  that  the 
solution  for  3D  pose  exists  and  is  unique  up  the  reflective  ambiguity. 

In  Section  4.2,  Proposition  1  was  used  to  infer  the  geometry  that  gives  rise  to  the  other 
solution  to  the  biquadratic,  namely  s  =  s^  (Equation  27).  This  solution,  which  is  illustrated  in 
Fig.  5,  is  obtained  by  inverting  the  roles  of  the  model  and  image  points  in  Fig.  3.  The  difference 
with  the  inverted  solution  is  that  the  image  points  are  being  scaled  and  then  orthographicaUy 
projected  onto  the  model  points,  instead  of  the  reverse.  The  inverted  geometry  satisfies  three 
constraints,  Equations  30-32,  that  parallel  the  true  constraints  in  function  and  form.  Similarly, 
the  expression  for  the  scale  factor  of  the  inverted  solution,  ^  ~  (Ekjuation  33),  parallels  the 
expression  for  the  true  scale  factor,  s  =  si. 

Three  special  cases  were  discussed  next,  one  in  which  the  plane  of  the  matched  noodel  points 
is  parallel  to  the  image  plane  (Section  4.3),  one  in  which  it  is  perpendicular  to  the  image  plane, 
or,  equivalently,  in  which  the  matched  image  pmnts  are  collinear  (Section  4.4),  and  one  in  which 
the  matched  model  points  are  collinear  (Section  4.5).  The  first  case  is  the  one  and  only  situation 
in  which  the  two  solutions  collapse  to  the  same  one,  and  in  this  case  hi  =  hj  =  0.  In  addition, 
this  situation  is  exactly  where  the  two  solutions  to  the  biquadratic  are  the  same;  this  is  seen 
geometrically  by  looking  at  Figs.  3  and  4  with  hi,  hi.  Hi,  and  aU  sero  cmd  s  =  si  =  si  =  7. 

In  the  case  where  the  matched  image  points  are  collinear,  the  solution  for  3D  pose  is  still 
valid.  It  is  interesting  to  note,  however,  that  the  inverted  solution  for  the  scale  factor  does  not 
exist.  Yet  the  inverted  solution  for  the  scale  does  exist  when  the  model  points  are  collinear,  but, 
in  this  case,  the  true  solution  does  not.  Section  4.5  determined  that  when  the  model  points  are 
collinear  a  solution  for  3D  pose  may  still  exist,  but  if  and  only  if  a  further  constraint  is  satisfied. 
The  section  concludes  by  giving  the  constraint  and  describing  how  it  arises  geometrically. 
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5  Image  Position  of  a  Fourth  Model  Point 


To  compute  the  position  in  the  image  of  a  fourth  model  point,  I  first  use  the  solution  from 
the  previous  section  to  compute  its  3D  positicm  in  camera- centered  coordinates.  By  so  doing, 
I  can  project  the  camera-centered  model  point  under  weak-perspective  and  obtain  the  image 
position  without  having  to  calculate  a  model-to-image  transformation.  Let  the  image  points  be 
*0  =  (®o»yo)>  *1  =  and  ij  =  Given  s,  hi,  hj,  we  can  invert  the  projection  to 

get  the  three  model  points: 

rnb  = 

rni  =  ^(xi,yi,hi  +  w) 

mj  =  -I- tfl), 

where  w  is  an  unknown  offset  in  a  direction  normal  to  the  image  plane. 

Given  three  2D  points,  4ot  4ii  q^,  a  fourth  2D  point  ^  can  be  uniquely  represented  by 
its  ‘^affine  coordinates,”  (a,  J3),  which  are  ^ven  by  the  equation 

ft  =  a(^l  -  qo)  +  -  ft)  + 

Given  three  3D  points,  p^,  p{,  and  pi,  this  representation  can  be  extended  to  uniquely  represent 
any  other  3D  point  in  terms  of  what  1  shall  call  its  “extended  affine  coordinates,”  (a,  /?,  7), 
as  follows: 


ft  =  a(ft  -  ft)  +  -  1^)  +  7(ft  -  ft)  X  (ft  -  ^)  +  ft 


(37) 


Let 


*01  =  *1  -  *o»  fti  =  yi  —  yto> 

*03  =  *2  -  *o»  yo2  =  1/2  —  ft- 

Then,  using  the  three  model  points  with  po  =  riio,  pi  =  mi,  and  ft  =  m2. 


II 

1 

-(*oi»yoi>^i) 

3 

(38) 

ft  -ft  = 

-(*02»y03>^2) 

3 

(39) 

(ft  -  ft)  X  (6 -ft)  = 

-  yb3^i>*03^i  -  *01^21  *oiyo2  -  *o3yoi)  • 

(40) 

Next,  substitute  Equations  38-40  into  Equation  37  to  get  the  three-space  location  of  the  fourth 
point: 

m3  =  ja(*oi>ftl>^»)  +  “^(*03»y03»^2) 
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+7'^(yoiAa  -  yoa^it  +  xo2hi,xoiyo2  -  a;o2yoi)  +  -{xo^Voi^) 

s  s 

,  a  1  ~®01^2  +  *02^1  , 

oyol  +  0yo2  +  7 - +  yo. 

S 

L  ,  aL  ,  .  *0iy02  -  *02yoi  .X 

afti  +  /3/ij  +  7 - hto)  (41) 

3 

To  project,  first  apply  the  scale  factor  s: 

-  /  ,  a  ,  yoi^2  -  yo2^i  , 

«ni3  =  (azoi  +  pxo2  +  7 - +  *o» 

s 

,  a  -XQihi  +  ^02^1  . 

ayol  +  ^yo2  +  7 - !■  yo» 

3 

+  (42) 

Let  n  represent  an  orthogonal  projection  al<mg  the  z  axis.  Then  project  orthographically  to 
get  the  image  location  of  the  fourth  point: 

n(srn3)  =  (azoj  +  fixo2  +  7(yoi-H2  ~  IhnBx)  +  *o* 

ayoi  +  ^yo2  +  7(~*oi-i5r2  +  *02-ffi)  +  Vo)  (43) 

Notice  that  the  unknown  offset  w  has  dropped  out.  This  expression  computes  the  image  position 
of  Pi  from  its  extended  a£Sne  coordinates,  from  the  image  points,  and  from  Hi  and  H2,  the 
altitudes  in  the  weak-perspective  geometry.  There  are  no  intermediate  results  about  the  actual 
3D  pose  stored  along  the  way,  and  as  a  result,  this  computation  should  be  very  efficient. 
Nonetheless,  it  should  be  kept  in  mind  that  H\  and  H2  depend  on  the  specific  iimaging  geometry; 
that  is,  they  depend  on  the  pose  of  the  model. 

It  may  be  worthwhile  to  observe  that  Eipiation  43,  the  expression  for  the  fourth  point,  can 
be  rewritten  as  a  weighted  sum  of  the  three  image  points: 

n(sn^)  =  {axoi  +  0X(t2  +  7(yoi^2  —  +  *o» 

Qltoi  +  ^yo2  +  li—xoiHi  +  «02^i)  +  yo) 

=  (a*i  -I-  'iHtVi^ayx  -  7^2*1)  -  (a*o  +  7^2yo»ay6  -  7JJ2*o)  + 

{fht2  -  7Hiy2,Py2  +  iHiX2)  -  {pxo  -  iHiyo^^o  +  iHiXo)  -I- 
(«o*yo) 

_  1  -  Q  -  ^  7(fri  -  H2)  1  [  *0 1  , 

-  -^{Hl-H2)  l-a-0  J  1  ito  J 

a  jHi  1  f  *1  1  .  f  P  -iBi  1  [  X2  1 

-iB2  q  J  [  yi  J  [  7^1  ^  J  [  ys  J 
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Let  R#  represent  a  2D  rotation  matrix  that  rotates  by  an  angle  0.  Then 


n(sm3)  =  (44) 

where 

=  v^(l-a-^)»  +  (7(Ri-J?2))»  (45) 

h  =  +  (iHif  (46) 

h  =  yJP  +  (7^1  )*  (47) 

cos^o  =  sindo  = 

cos  ~  j-  sin  01  =  (48) 

COS0J  =  ^  sin0a  =  ^ 


Thus,  we  can  view  the  computation  as  a  2D  rotation  and  scale  of  each  image  point  separately 
followed  by  a  sum  of  the  three.  It  is  important  to  keep  in  mind,  however,  that  the  rotations 
and  scales  themselves  depend  on  the  image  pc^ts,  because  of  Hi  and  Hi. 

When  the  model  is  planar,  the  form  of  Equation  44  facilitates  understanding  the  effects 
of  error  in  the  image  points.  Error  in  the  locations  of  the  matched  image  points  leads  to 
uncertainty  in  the  image  location  of  the  fourth  model  point.  Suppose  that  the  true  locations  of 
the  matched  image  points  are  known  to  be  within  a  few,  say  Cj,  pixels  of  their  nominal  locations, 
for  i  =  0, 1,2.  Let  ti  and  Si  be  the  true  and  nominal  locations  of  an  image  point,  for  t  =  0, 1,2. 
Then,  for  some  eo,  to  =  ^  ;re  ||  eb  ||=  eo>  and  similarly  for  t’l  and  t2.  Then 

n(sm3)  =  ^oR|q*0  + 

=  (^oR#g<3o  +  0lR#jCl  +  03R«3<^)  +  (0oR#g^  + 

When  the  fourth  point  is  in  the  plane  of  the  first  three,  7  =  0,  so  that  the  scales,  and  0], 
and  2D  rotations,  Rag,  Raj,  and  R«2  ,  are  all  constant  (see  Equations  45-48).  This  means  that 
the  first  term  in  parentheses  is  just  the  notmnal  image  location  of  the  fourth  model  point.  Since 
Co,  ei,  and  ei  move  around  circles,  the  2D  rotatimis  in  the  second  term  can  be  ignored.  Further, 
since  these  error  vectors  move  independently  around  their  error  circles,  their  radii  simply  sum 
together.  Therefore,  the  region  of  possible  locaticms  of  the  fourth  model  point  is  bounded  by  a 
circle  of  raditu  0oCo  +  +  Sifi  that  is  centered  at  the  nominal  point.  By  plugging  7  =  0  into 

Equations  45-47,  we  get  that 

Sti  =  \\-  a-  ^\,  01  =  |a| ,  03  =  |/3| , 

Assuming  eg  =  ci  =  63  =  e,  this  implies  that  the  uncertainty  in  the  image  location  of  a  fourth 
point  is  bounded  by  a  circle  with  radius  (|1  -  a  -  /9|  -I-  |a|  -t- 1^|)6  and  with  its  center  at  the 
nominal  point,  which  repeats  the  result  given  earlier  by  Jacobs  [19]. 

Althou^  the  non-planar  case  clearly  is  more  complicated,  since  the  scales  and  2D  rotations 
are  no  longer  constant.  Equation  44  mi^  prove  useful  for  obtaining  bounds  on  the  effects  of 
error  in  this  situation  as  welL 
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6  Stability  of  the  3D  Pose  Solution 


In  numerical  computations,  it  is  well-advised  to  determine  whether  a  computation  is  stable, 
since,  if  not,  it  could  produce  inaccurate  results.  A  computation  is  unstable  if  any  roundoff 
error  can  propagate  and  magnify  such  that  the  true  answer  is  significantly  altered.  The  most 
common  source  of  roundoff  error  is  known  as  catastrophic  cancellation,  where  two  munbers  of 
nearly  equal  magnitudes  and  opposite  signs  are  summed.  In  fact,  catastrophic  cancellation  is 
the  only  way  a  sudden  loss  of  precision  can  occur  [31].  Otherwise,  in  general  precision  can  be 
lost  by  an  accumulation  of  small  errors  over  several  operations. 

In  the  3D  pose  solution,  there  are  a  few  subtractions  of  positive  numbers  to  be  wary  of.  In 
computing  h\  and  from  s  (Equation  7),  the  values  of  h-i  and  may  have  little  precision  if 
cancellation  occurs  in  the  radicands,  in  which  case  h\  or  will  be  small  relative  to  its  range 
of  values.  As  discussed  at  the  end  of  Section  4.1,  h\  or  ^2  is  zero  when  one  of  the  sides  of  the 
model  triangle  that  emanates  from  nio  lies  parallel  to  the  image  plane. 

The  calculation  of  hx  and  ht  can  also  be  unstable  if  s  Is  inaccurate.  Looking  at  Equation  6 
and  recalling  that  a,  b,  and  c  are  non-negative,  catastrophic  cancellation  can  only  occur  in  the 
inner  radicand.  Even  if  it  does,  this  is  not  a  problem,  since  the  result  of  the  square  root  would 
be  negligible  when  added  to  b. 

Another  way  for  s  to  become  inaccurate  is  if  the  value  of  a,  h,  or  c  in  Equation  6  is  obtained 
with  little  precision.  For  a  and  c,  Equations  9  and  11  show  in  parentheses  one  of  the  sides  of 
a  triangle  being  subtracted  from  the  sum  of  the  other  two;  therefore,  catastrophic  cancellation 
may  occur  when  the  triangle  is  nearly  a  line.  Equation  10  shows  that  cancellation  may  occur 
in  computing  b  if  either  the  terms  in  parentheses  or  the  total  sum  approaches  zero  relative  to 
their  ranges  of  values.  From  the  law  of  cosines,  the  terms  in  parentheses  are  near  zero  when 
some  angle  of  the  model  triangle  is  small.  From  Equation  25,  the  total  sum,  i.e.,  b,  is  small 
only  if  certiun  angles  in  the  model  and  inoage  triangles  are  snudl  also.  This  says  we  should  be 
careful  of  5  in  the  same  circumstances  in  which  we  are  careful  of  a  and  c,  namely,  when  the 
model  or  image  points  are  nearly  collinear. 

To  conclude,  the  parameters  s,  hi  and  A2  (ot  s.  Hi,  and  Hi)  are  prone  to  instability  when 
the  matched  model  or  image  points  are  almost  collinear,  and,  additionally.  Hi  or  H^  can  be 
unstable  when  one  of  the  vectors  from  mo  to  mi  or  m2  is  nearly  parallel  to  the  image.  In  the 
latter  case,  the  unstable  Hi  or  H2  is  close  to  zero.  If  only  one  of  Hi  and  H2  is  close  to  zero, 
then  the  instability  can  be  avoided  by  re-ordering  the  matched  points  to  make  both  Hi  and 
H2  large.  However,  if  this  is  done,  the  difference  Hi  -  H2  will  be  dose  to  zero  and  may  be 
impredse.  If  both  Hi  and  H2  are  almost  zero,  which  means  the  model  triangle  is  nearly  parallel 
to  the  inoage,  then  re-ordering  the  matched  points  will  not  help. 

Finally,  it  is  worth  observing  that  much  of  the  instability  in  the  pose  solution  occurs  at 
places  in  which  the  problem  is  ill  conditioned,  that  is,  places  where  instability  is  inherent  in 
the  geometry.  For  instance.  Hi  was  said  to  be  unstable  when  the  vector  from  mo  to  mi  is 
nearly  parallel  to  the  image.  Geometrically,  in  this  situation  a  small  change  in  the  position  of 
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t'l  can  cause  a  large  change  in  the  altitude  Hi  (Fig.  2).  For  the  same  reason,  recovering  the 
altitude  Hi  is  unstable  when  the  vector  from  nio  to  m3  is  nearly  parallel  to  the  image.  This 
situation  would  be  worse  if  both  vectors  emanating  from  mo  were  parallel  to  the  image.  By 
a  similar  argument,  it  is  intrinsically  imstable  to  recover  the  pose  when  the  model  points  are 
nearly  collinear,  due  to  there  being  an  infinity  of  solutions  when  the  model  points  are  exactly 
collinear  (Section  4.5). 

This  suggests  that  recognition  systems  like  alignment  and  pose  clustering  should  give  special 
attention  to  situations  where  the  model  triangle  is  almost  a  line  and  where  the  model  triangle 
being  viewed  straight  on.  These  cases  could  be  avoided  by  checking  if  the  model  points  are 
nearly  collinear  or  if  the  corresponding  angles  between  the  model  and  image  points  are  very 
close.  For  the  latter  case,  the  suggestion  does  not  apply  if  alignment  is  being  used  to  recognize 
planar  models.  This  is  because,  if  Equation  14  is  used,  error  in  Hi  or  H2  has  no  effect  on  the 
image  locations  of  points  in  the  plane,  since  for  these  points  7  =  0. 


7  Review  of  Previous  Solutions 

There  have  been  several  earlier  solutions  to  the  weak-perspective  three- point  problem,  notably 
by  Kanade  and  Kender  [20],  Cyganski  and  Orr  ([7],  [8]),  Ullman  ([28],  [17]),  Huttenlocher 
and  Ullman  ([16],  [18],  [29]),  and  Grimson,  Huttoolodher,  and  Alter  [12].  All  the  previous 
solutions  compute  the  3D  pose  by  going  through  a  3D  rigid  transformation  or  a  2D  affine 
transformation  relating  the  model  to  the  image.  A  2D  affine  transform  is  a  linear  transform 
plus  a  translation,  and  it  can  be  applied  to  any  object  lying  in  the  plane.  All  but  Ullman’s 
and  Grimson,  Huttenlocher,  and  Alter’s  solutions  compute  w  affine  transformation  between 
the  three  model  and  image  points.  Also,  all  but  Kanade  and  Render’s  solution  compute  a 
model-to-image  rigid  transformation,  either  via  a  rotation  matrix  or  via  Euler  angles. 

Not  all  of  the  solutions  directly  solve  the  weak-perspective  three-point  problem.  The  earliest 
solution,  which  was  given  by  Kanade  and  Kender  in  1983,  applies  Kanade’s  skewed-symmetry 
constrmnt  to  recover  the  3D  orientation  of  a  symmetric,  planar  pattern  [20].  More  precisely, 
Kanade  and  Kender  showed  how  to  compute  the  3D  orientation  of  the  plane  containing  a 
symmetric,  planar  pattern  from  a  2D  affine  transform  between  an  image  of  the  pattern  and 
the  pattern  itself.  To  apply  this  result  to  the  weak-perspective  three-point  problem,  the  three 
points  can  be  used  to  construct  a  symmetric,  planar  pattern,  and  a  2D  affine  transform  can  be 
computed  from  two  sets  of  three  corresponding  points.  The  solution  was  shown  to  exist  and  to 
give  two  scdutions  related  by  a  reflective  ambiguity,  assuming  that  the  determinant  of  the  affine 
transform  is  positive. 

The  remaining  methods  all  concentrate  on  computing  the  3D  ripd  transform  from  the  model 
to  the  image.  In  1985,  while  presenting  a  system  for  recognizing  planar  objects,  Cyganksi  and 
Orr  showed  how  to  use  hi^er-order  moments  to  compute  a  2D  afiSne  transform  between  planar 
regions  ([7],  [8]).  Given  the  affine  transform,  they  listed  expressions  for  computing  the  3D 
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Euler  angles  from  the  2D  affine  transform^.  They  did  not,  however,  discuss  how  they  derived 
the  expressions. 

The  next  method  is  the  solution  given  by  UUman  in  1986  [28],  which  appeared  again  in  [17]. 
The  paper  included  a  proof  that  the  solution  for  the  scale  factor  is  unique  and  the  solution  for 
the  rotation  matrix  is  unique  up  to  an  inherent  two-way  ambiguity.  (This  corresponds  to  the 
ambiguity  in  Hx  and  But  UUman  did  not  show  the  solution  exists.  When  it  does  exist, 
Ulhnan  described  a  method  for  obtaining  the  rotation  matrix  and  scale  factor. 

In  1988,  Huttenlocher  and  UUman  gave  another  solution,  and,  in  the  process,  gave  the 
first  complete  proof  that  the  solution  both  exists  and  is  unique  (up  to  the  two-way  ambiguity) 
([16],  [18],  [29]).  Like  Kanade  and  Kender,  and  Cyganski  and  Orr,  Huttenlocher  and  UUman ’s 
solution  reUes  on  a  2D  affine  transform.  The  solution  itself  is  based  on  algebraic  constraints 
derived  from  rigidity,  which  are  used  to  recover  the  elements  of  the  scaled  rotation  matrix. 

The  last  solution,  which  was  published  this  year,  was  developed  by  Crimson,  Huttenlocher, 
and  Alter  for  the  purpose  of  analyzing  the  effects  of  image  noise  on  error  in  transformation 
space  [12].  Towards  this  end,  the  method  facilitates  computing  how  a  smaU  perturbation  in 
each  transformation  parameter  propagates  to  uncertainty  ranges  in  the  other  parameters. 


8  Presentation  of  Three  Previous  Solutions 

The  solutions  discussed  in  the  previous  section  differ  significantly  in  how  they  compute  the 
transformation,  and,  as  a  result,  each  one  can  provide  different  insights  into  solving  related 
problems,  such  as  error  analysis  in  alignment-based  recognition  and  pose  clustering.  It  seems 
useful,  then,  to  present  the  previous  solutions  in  detail,  so  they  conveniently  can  be  referred  to 
and  compared. 

The  first  method  presented  is  Ullman’s  solution,  which  the  first  part  of  this  paper  extended. 
Alter  that,  I  give  Huttenlocher  and  Ullman’s  solution.  Lastly,  I  present  the  method  of  Crimson, 
Huttenlocher,  and  Alter.  I  do  not  present  Kanade  and  Kender’s  method  nor  Cyganski  and  Orr’s, 
because  Kanade  and  Kender  did  not  directly  solve  the  weak-perspective  three-point  problem, 
and  Cyganski  and  Orr  did  not  detail  their  solution. 

It  should  be  pointed  out  that  the  presentations  here  differ  somewhat  from  the  ones  given 
by  the  original  authors,  but  the  ideas  are  the  same.  Basically,  the  presentations  emphasize  the 
steps  that  recover  the  3D  pose  while  being  complete  and  concise.  For  more  details,  the  reader 
is  referred  to  the  original  versions  in  the  references. 

In  the  following  presentations,  we  are  looking  for  a  rigid  transform  plus  scale  that  aligns 
the  model  points  to  the  image  points.  In  all  methods,  we  are  free  to  move  ripdly  the  three 
image  points  or  the  three  model  points  wherever  we  wish,  since  this  amounts  to  tacking  on  an 
additional  transform  before  or  after  the  aligning  one.  For  example,  this  justifies  the  assumption 
made  below  that  the  plane  of  the  model  points  is  parallel  to  the  image  plane. 

‘The  expreerions  that  appear  in  [7]  contain  typesetting  errors,  but  are  listed  correctly  in  [8]. 
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For  consistency,  the  same  notation  as  in  Sections  3  and  4  is  used  in  the  proofs  that  follow: 
Let  the  model  points  be  mo,  mi,  mj  and  the  image  points  be  to,  t'l,  tj,  with  the  respective 
distances  between  the  points  being  Rqi,  Rq2,  and  R12  for  the  model  points,  and  doi,  do2,  and 
di2  for  the  image  points. 


8.1  Overview 

This  section  provides  an  overview  of  the  three  methods. 

Initially,  all  three  methods  compute  a  transformation  that  brings  the  model  into  image 
coordinates,  such  that  the  plane  of  the  three  matched  model  points  is  parallel  to  the  image 
plane  and  such  that  mo  projects  onto  to,  which  has  been  translated  to  the  origin.  The  three 
methods  then  compute  the  out-of-plane  rotation  and  scale  that  align  the  matched  model  and 
image  points.  In  so  doing,  the  methods  aU  end  up  solving  a  biquadratic  equation. 

In  Ullman's  method,  the  model  and  image  points  are  further  transformed  via  rotations 
around  the  2  axis  to  align  mi  and  I'l  along  the  x  axis.  Then  the  3D  rotation  matrix  for  rotating 
successively  around  the  x  and  y  axes  is  expressed  in  terms  of  Euler  angles.  This  leads  to  a 
series  of  three  equations  in  three  unknowns,  which  are  solved  to  get  a  biquadratic  in  the  scale 
factor.  To  get  the  elements  of  the  rotation  matrix,  the  solution  for  scale  factor  is  substituted 
back  into  the  ori^al  three  equations. 

Instead  of  further  rotating  the  model  and  image  points,  Huttenlocher  and  UUman  compute 
an  aflSne  transform  between  them,  which  immediately  gives  the  top-left  sub-matrix  of  the  scaled 
rotation  matrix.  Then  by  studying  what  happens  to  two  equal-length  vectors  in  the  plane,  a 
biquadratic  is  obtained.  The  scale  factor  and  the  remaining  elements  of  the  scaled  rotation 
matrix  are  found  using  the  algebraic  constraints  on  the  columns  of  a  scaled  rotation  matrix. 

Like  UUman  did.  Crimson,  Huttenlocher,  and  Alter  rotate  the  model  further  to  align  mi 
and  t'l.  The  desired  out-of-plane  rotation  is  expressed  in  terms  of  two  angles  that  give  the 
rotation  about  two  perpendicular  axes  in  the  plane.  Next,  Rodrigues’  formula,  which  computes 
the  3D  rotation  of  a  point  about  some  axis,  is  used  to  eliminate  the  scale  factor  and  obtain  two 
constraints  on  the  two  rotation  angles.  The  two  constriunts  are  solved  to  get  a  biquadratic  in 
the  cosine  of  one  of  the  angles.  Its  solution  is  substituted  back  to  get  the  other  angle  and  the 
scale  factor,  which  can  be  used  Erectly  by  Rodrigues’  formula  to  transform  any  other  model 
point. 

As  mentioned  in  the  introduction,  Ullman’s  solution  is  incomplete  because  it  does  not  show 
which  of  the  two  solutions  for  the  scale  factor  is  correct;  actuaUy,  the  solution  is  completed  by  the 
result  given  in  Section  4.1  of  this  paper.  Similar  to  Ullman’s  method.  Crimson,  Huttenlocher, 
and  Alter’s  solution  has  the  same  drawback  of  not  showing  which  solution  to  its  biquadratic  is 
correct.  Huttenlocher  and  UUman,  on  the  other  hand,  have  no  such  problem  because  it  turns 
out  that  one  of  the  two  solutions  to  their  biquadratic  is  obviously  not  real,  and  so  it  immediately 
is  discarded. 
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8.2  UUman’s  method 


This  section  gives  Ullman’s  solution  to  the  weak-perspective  three-point  problem.  The  main 
idea  is  first  to  transform  the  three  model  points  to  the  image  plane  and  then  solve  for  the  scale 
etnd  out-of-plane  rotation  that  align  the  transformed  points. 


Specifically,  the  model  points  first  are  rigidly  transformed  to  put  the  three  model  points  in 
the  image  plane  with  tuo  at  the  origin  of  the  image  coordinate  system  and  m\—mQ  aligned  with 
the  X  axis.  After  rigidly  transforming  the  model  points,  the  resulting  points  can  be  represented 
by  (0,0,0),  (*1,0,0),  and  (*2,jrj,0).  Similarly,  let  the  image  points  be  rigid  transformed  to 
put  t'o  the  origin  and  t'l  -  I'o  along  the  x  axis,  and  let  the  resulting  image  points  be  (0,0,0), 
(*1,0,0),  and  (*j,y2,0). 


Next,  we  break  the  out-of-plane  rotation  into  a  rotation  around  the  z  axis  by  an  angle  d 
followed  by  a  rotation  around  the  y  axis  by  an  an^e  (^,  as  pictured  in  Fig.  7.  The  corresponding 
rotation  matrix  is 


R  = 

cos  4>  0  sin  <t> 

0  1  0 

10  o' 

0  cos$  —  sin0 

—  sin  <f>  0  cos  4> 

0  sin  d  cos  $ 

COS0  sin^sind  sm^cosd 
0  cosd  -sind 

-sin^  cos^sind  cos^cos^ 


(49) 


After  rotation  and  scale,  (0,0,0),  (*i,0,0),  and  (*2,y2,0)  become  (0,0,0),  (*i,0,ri),  and 
(*2,y2,*2))  respectively,  where  zi  and  Z2  are  unknown.  Thus,  we  need  to  find  0,  <f>,  and  s 
such  that 


sR(zi,0,0)  =  (xi,0,zi) 
sR(*2,y2,0)  =  (*2,y2,«j) 

Expanding  the  first  two  rows  of  R  yields  three  equations  in  three  unknowns: 


s*i cos  0  = 

*1 

(50) 

s^cos9  = 

yi 

(51) 

szacos^  — sj^sin^sind  = 

*3 

(52) 

Fig.  7  gives  a  graphical  interpretation  of  the  first  two  equations.  Substituting  Equations  50 
and  51  along  with  expressions  for  sin^  and  sin  9  into  Equation  52  yields  a  biquadratic  in  the 
scale  factor  s: 


as*  -  bs^  -b  c  =  0, 

(53) 

where 

a  =  zi’** 

(54) 

b  =  x\{x2^  ■¥  yi^)^■x^^{^\■\-y\)-2zxx■iXl£2 

(55) 

cqtN 

1) 

(56) 
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The  positive  solutions  for  s  are  given  by 


[6 i 

2^ 


:  i  1 


(57)  • 


In  general  there  can  be  one,  two,  or  no  solutions  for  s.  UUman  makes  no  further  attempt  tof 
determine  when  or  if  each  solution  arises,  except  to  refer  to  a  uniqueness  proof  he  gives  earlier 
in  the  paper.  The  uniqueness  proof  implies  there  can  be  at  most  one  solution  for  s,  but  does^ 
not  say  which  solution  it  is  or  whether  it  can  be  either  one  at  different  times. 

Given  s,  the  rotation  matrix  R  is  obtained  using  cos  ^  ^  and  cos  0  =  ^va.  Equation  49. 

One  difficulty  with  this  is  that  we  do  not  know  the  signs  of  sind  and  sin^;  this  leaves  four- 
possibilities  for  the  psur  (sin  0,  sin  ^).  In  his  uniqueness  proof,  UUman  points  out  that  thet 
inherent  reflective  ambiguity  corresponds  to  mriltiplying  simultfineously  the  elements  ris,  c. 
rai,  and  rsa  of  R  by  -1.  In  Equation  49,  the  signs  of  those  elements  also  are  inverted  whenv-r 
both  sin  9  and  sm<t>  are  multiplied  by  —1,  which,  visuaUy,  corresponds  to  reflecting  the  model 
points  about  the  image  plane  (Fig.  7).  Still,  we  have  no  way  to  know  which  of  the  two  pairs  of 
solutions  is  correct.  One  way  to  proceed  is  to  try  both  and  see  which  solution  pair  aligns  the 
points. 


8.3  Huttenlocher  and  Ullman’s  method 

V 

First,  assume  the  plane  contuning  the  model  points  is  pwaUel  to  the  image  plane.  Then^ 
subtract  out  mo  and  to  from  the  model  and  image  points,  respectively,  to  align  them  at  the 
origin.  Let  the  resulting  model  points  be  (0,0,0),  (ei,]ri,0),  and  (z3,^,0),  and  the  resulting 
image  points  be  (0,0),  (zi,yi),  and  (zstPa).  At  this  point,  what  is  left  is  to  compute  the  scaled  , 
rotation  matrix  that  brings  («i,yi,0)  and  (xii, 2/3,0)  to  (zi,!/i,2i)  and  (23, 2/3,  23),  respectively. 
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where  zi  and  Z2  are  unknown.  That  is,  we  need 

sR(*i,jri,0)  = 

sR(*”2,  2/2,0)  =  (x2,1/2,23). 

Letting  In  =  srn,  I12  =  'Sri2,  etc.,  and  focusing  on  the  first  two  rows  of  the  rotation  matrix, 
we  get  two  sets  of  equations: 


^11*1  +  hiVi 

=  *1 

(58) 

hi^i  +  hiVi 

=  *2 

(59) 

h\x\  -f-  hiVi 

=  yi 

(60) 

hiX2  +  hiVi 

=  yi, 

(61) 

which  give 


,  the  top-left  sub-matrix  of  the  scaled  rotation  matrix.  Note  that  this 


I12 
hi  hi 

step  fails  if  the  determinent,  :Bi2r2  —  zslh,  equals  zero. 


Next,  we  make  a  digression  to  consider  what  happens  to  two  orthogonal,  equal-length  vectors 
in  the  plane,  el  and  el.  Since  el  and  el  are  in  the  plane,  we  can  apply  the  sub-matrix  just 
computed  to  obt^  the  resulting  vectors,  el'  and  ej': 


cl' 


■  hi 

hi 

hi  hi 

hi 

hi 

Cl, 

e2  — 

hi  in 
>  « 

el 


(62) 


When  a  model  is  transformed,  el  and  el  undergo  a  rigid  transformation  plus  scale  before 
projection.  As  shown  in  Fig.  8,  after  transformation  these  vectors  become  el'-l-ciz  and  e2'+C2Z. 
Since  a  scaled,  rigid  transform  preserves  angles  and  ratios  of  lorgths  between  vectors,  and  since 
el  •  el  =  0  and  ||  el  ||=||  el  |l,  it  must  be  that 

(el'  +  ciz) '  (el'  +  C2z)  =  0 

II  «r  II +A  =11  eV  II  +cl. 

These  two  equations  sinqilify  to 

C1C2  =  ki 

c\-cl  =  k2 

where 

ki  =  -el'  •  el' 
ki  =  II  el' II  -  II  el' II 
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(63) 

(64) 


Fig\ue  8:  Projecting  two  orthogonal  same-length  vectors  in  Huttenlocher  and  Ullman’s 
method. 


Substituting  for  cj  =  ^  in  the  second  equation  leads  to  a  biquadratic  in  Ci; 

c}  -  kic\  -  =  0  (65) 

The  general  solution  is 

Conveniently,  the  inner  discriminant  always  is  greater  than  or  equal  to  zero.  Furthermore,  since 
Akl  >  0,  the  real  solutions  are  (pven  by 

Cl  =  ±^-  +  ^*2  +  4^^  ,  (66) 

since  otherwise  the  outer  discriminant  is  less  than  zero. 

These  two  solutions  for  Ci  give  two  corresponding  solutions  for  C2,  which  from  Fig.  8  can  be 
seen  to  correspond  to  a  reflection  about  the  image  plane. 

The  solution  for  cj  does  not  work  when  ci  =  0.  In  this  case. 


cj  =  ±y/-ki  =  ±^||  e\  II  -  II  eV  II .  (67) 

This  gives  two  solutions  for  C3,  if  it  exists,  which  can  be  seen  as  follows.  Since  ci  =  0,  ej  ends 
up  in  the  plane,  so  that  that  the  length  of  ei  is  just  scaled  down  by  s,  whereas  the  length  of 
C}  reduces  both  by  being  scaled  down  and  by  projection.  Consequently,  ||  ej'  ||<||  el'  ||,  and, 
therefore,  cj  exists. 
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Given  c\  and  C2,  we  can  recover  two  more  elements  of  the  scaled  rotation  matrix.  Since  e\ 
and  ej  are  in  the  plane,  we  know  that  sRei  =  €\  +  ciz  and  sRe2  =  Focusing  on  the 

last  row  of  the  scaled  rotation  matrix,  we  get  the  two  equations  /31  =  cj  and  I32  =  C2. 

At  this  point,  we  have  the  first  two  columns  of  sR,  and,  from  the  constraints  on  the  columns 
of  a  rotation  matrix,  we  can  get  the  last  column  from  the  cross  product  of  the  first  two.  In 
total,  this  gives 


I12 

^(02/21  -  Ci/22) 
i(cili2  -  C2/11) 

hi 

h2 

Cl 

C2 

f(hih2  —  ^12^21) 

(68) 


Since  the  columns  of  a  rotation  matrix  have  unit  length,  we  know 


Notice  that  the  ambiguity  in  ci  and  C2  inverts  the  signs  of  the  appropriate  elements  of  the 
rotation  matrix  as  discussed  in  Section  8.2. 

8.4  Grimson,  Huttenlocher,  and  Altar’s  method 

Crimson  et  al.  gave  another  solution  to  the  weak-perspective  three  point  problem  in  order  to 
get  a  handle  on  how  small  perturbations  affect  the  individual  transformation  parameters. 

To  start,  assume  the  plane  containing  the  model  points  is  parallel  to  the  image  plane.  Next, 
rigidly  transform  the  model  points  so  that  mo  projects  to  to  and  mi  -  mo  projects  along  t'l  -  to. 
Let  n  represent  an  orthogonal  projection  along  the  z  axis,  and  in  general  let  v-^  be  the  2D 
vector  rotated  ninety  degrees  clockwise  from  the  2D  vector  v.  Then  the  translation  is  to  -  Ilmo, 
and  the  rotation  is  about  z  by  an  angle  ^  given  by 

cos  =  n*oi  •  ^91,  sin  V’  =  -w*oi  *  *or 


(see  Fig.  9). 

At  this  point,  assign  tiioi  =  ~~  *Tio3  =  m2  ~  *01  =  *1  ~  *o»  *oj  =  ~ 

Also,  consider  the  out-of-plane  rotation  to  be  a  rotation  about  toi  by  some  angle  0  followed  by 
a  rotation  about  tgi  by  some  angle  <f>.  Let  us  compute  where  the  vectors  toi  and  igi  project  to 
after  the  two  rotations  and  scale.  To  do  this,  we  use  Rodrigues’  formula:  Let  R^^^F  represent 
a  rotation  of  a  point  p  about  a  direction  o  by  an  angle  r.  Rodrigues’  formula  is 

R^.^p  =  cosTp+  {1  -  co8t)(u  •  p)v  +  sinT(9  x  p).  (70) 
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Using  the  formula,  we  can  compute 

=  cos^i-sin^z  (71) 

I^x  -*01  =  sin  ^  sin  +  cos  0»oi  +  sin  ^  cos  (jn. 

»oi*v  »«»<• 

Initially,  ntoi  was  rotated  about  z  to  align  it  with  t'oi*  In  order  for  the  scaled  orthographic 
projection  of  nioi  to  align  with  toi.  Equation  71  implies  that 

^  ^  11^1  II  1 
II  »j*01  II  cos  <l> 


_  doi  1 

Roi  cos<ff' 

(72) 

Then 

sIIRrrx  -*01  — 

»oi«4  *01.#^* 

-5— *01 
•Koi 

(73) 

«oi*4  ««i.s 

^  (singsin<^i  +  cosW^i) 

AQX  COS  <p 

(74) 

Next,  we  use  the  expressions  in  Equations  73  and  74  to  constrain  0  and  such  that  nioz 
projects  along  ioj.  When  we  aligned  fi^i  and  toi*  rotated  to  ^ 

component  (by  assumption),  we  can  represent  R2;^mo2  by 

Ro2  cos  ^ioi  +  ^2  sin  (»oi, 
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where  ^  is  a  known  angle.  Consequently,  the  transformed,  projected,  and  scaled  mo2»  which 
must  equal  t'os,  is 


iR.'-  « 


(Jto3  cos  1^01  +  Ro2  sin^i^) 

=  Ro3  cos  ^(snRj-j.  ^  ^toi )  +  ^02  sin ^(snRj-x  ^^R^^  j*oi ) 


=  Ro3  cos  ( 


(^*01^  +  Ro2sin^  ^  cos0i^i)^ 

=  -^^^^^(cos^  cos<^  +  sin  ^  sin  ^  sin  0)toi  +  -^~~^(sm(cos0)in,. 
cos^Roi  cos^Roi 


Similar  to  R^^^ino29  we  can  represent  to2  us 


*02  =  ^2  coswtoi  +  doj  sina;toit 


where  u>  is  known.  By  equating  terms  we  get 
doi  -1202, 


do2  ^01 


(cos  ^  cos  ^  +  sin  (sin  ^  sin  0)  =  cos^cosu; 


doi  •Ro2 


(sin  (  cos  0)  =  cos  ^  sin  u;. 


do2  jRoi 

These  two  equations  can  be  solved  to  get  a  biquadratic  in  cos 


sin*u>cos^  ^  —  (t*  +  1  —  2t  cosb;cos()cos^  ^  +  t*  sin*(  =  0, 


(75) 

(76) 


(77) 


where 

_  Roi^oi 
Roidoj 

Since  is  aligned  with  toil  need  cos  ^  to  be  positive  so  that  nioi  projects  in  the  same 

direction  as  t'oi.  The  positive  solutions  are  given  by 


COS0  = 


sin’a»sin^( 


(79) 


with 

V  =  i(l  +  t*  -  2tco8u>cos(). 

A 

This  equation  gives  up  to  two  solutions,  but  Grimson  et  al.  make  no  farther  attempt  to  show 
which  solutions  exists  when,  except  to  say  the  equation  pves  real  solutions  only  if  v  >  0  or 


cos  u;  cos  (  < 


l±i! 

2t  ■ 


(80) 
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Given  Equations  75  and  76  provide  6-. 


cos  9 
sin  9 


sin  u;  cos  ^ 
tsin^ 

cos<f>{cos<j  -  f  cos^) 
t  sin  ^  sin  0 


(81) 

(82) 


Given  any  model  point  m,  we  can  use  the  computed  angles  along  with  Rodrigues’  formula 
to  find  its  image  location.  In  particular,  once  nio  ai^d  to  have  been  subtracted  out,  only  the 
scale  and  3D  rotation  are  left.  The  scale  is  given  by  Equation  72,  and,  as  shown  above,  the 
rotation  is 


,9  *01 


(83) 


As  with  UUman’s  method  (Section  8.2),  we  do  not  know  the  signs  of  sind  and  sin^,  but  only 
that  inverting  both  signs  simultaneously  corresponds  to  the  reflective  ambiguity. 

8.5  Summary  of  the  three  computations 

Here  I  summarize  how  each  method  can  be  used  to  compute  3D  pose  from  three  corresponding 
points.  To  begin,  transform  the  model  and  inoage  points  so  that  (1)  the  model  points  lie  in  the 
image  plane,  (2)  tuq  and  to  arc  at  the  origin  of  the  image  coordinate  system,  and  (3)  mi  -  nio 
and  ti  -  to  lie  along  the  x  axis.  Then  use  one  of  the  three  methods  to  compute  the  scale  factor 
and  out-of-plane  rotation,  as  follows: 

•  UUman’s  method 

1.  Use  Equations  54-56  to  get  a,  6,  and  c. 

2.  Substitute  <t,  6,  and  c  into  Equation  57  to  get  s. 

3.  Calculate  cos  <f>  =  -^  and  cos  9  = 

4.  Calculate  sin ^  =  y/1  —  cos*  ^  and  sin9  =  Vl  -  cos^9. 

5.  Construct  the  rotation  matrix  R  using  Equation  49. 

•  Huttenlocher  and  UUman’s  method 

1.  Sdve  Equations  58  and  59  for  In  and  I12,  and  Equations  60  and  61  for  I21  and  I33. 

2.  Let  ei  =  (0,1)  and  el  =  (1,0).  (Any  orthogonal,  equal-length  vectors  can  be  used.) 

3.  Use  Elation  62  to  get  ei'  and 

4.  Substitute  el'  and  el'  into  Equations  63  and  64  to  get  ki  and  k2. 

5.  Substitute  kt  and  ks  into  Equation  66  to  get  ci. 
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6.  If  Cl  ^  0,  calculate  C2  =  Otherwise  get  cj  from  Equation  67. 

7.  Use  Equation  69  to  get  s. 

8.  Use  Equation  68  to  get  sR.  Divide  through  by  s  if  R  is  desired  instead  of  sR. 

•  Crimson,  Huttenlocher,  and  Alter’s  method 

1.  From  the  model  points,  compute  Roi<  Ro2  from  the  image  points,  com¬ 

pute  dial,  db2,  and  u. 

2.  Use  Equation  78  to  get  t. 

3.  Use  Equation  79  to  get  cos  4>- 

4.  Use  Equation  72  to  get  s. 

5.  Calculate  sin  ^  =  \/l  -  cos*^. 

6.  Use  Equations  81  and  82  to  get  cosd  and  sin0. 

7.  To  transform  any  point  p,  substitute  cos^,  sin^,  cos$,  sint),  and  pinto  Rodrigues’ 
formula.  Equation  70,  to  get  Rp  =  Rr-j.  .R-r  ^p. 

9  Stability  and  Comparison  of  Three  Previous  Solutions 

For  computing  3D  pose,  it  is  desirable  to  know  how  the  solutions  conqtare  in  terms  of  stability. 
To  address  this  issue,  let  us  examine  how  susceptible  the  solutions  are  to  catastrophic  cancella¬ 
tion  [31].  For  ease  of  reference,  I  will  indicate  which  steps  in  the  pose  computation  summaries 
of  Section  8.5  may  be  unstable. 

Ulhnan’s  solution  computes  s  in  the  same  way  as  this  paper  does,  and,  as  a  result,  is  unstable 
at  the  same  places  (see  Section  6).  For  instance,  precision  may  be  lost  if  the  model  or  image 
pmnts  are  nearly  coUinear  when  computing  the  coefficients,  a,  6,  and  c,  of  the  biquadratic.  (In 
Section  8.5,  this  is  step  1  of  Ulhnan’s  solution.)  Looking  for  a  moment  at  Ullinan’s  computation 
of  a  and  c,  it  may  appear  that  the  computation  is  stable  since  there  is  no  addition  in  Equation  54 
or  56.  In  actuality,  instability  is  hidden  in  the  initial  transformation  that  aligns  the  model  and 
image  with  the  z  axis. 

Given  s,  UBman  computes  the  cosines  of  the  an^es  0  and  ^  and  then  implicitly  uses 
V'l  -  co»^0  and  y/l  —  cos*^  to  get  their  sines.  (This  is  step  4  of  Ullman’s  solution.)  Ei¬ 
ther  sine  could  be  inaccurate,  however,  if  cos  9  or  cos^  is  very  close  to  one.  Fig.  7  shows  that 
when  this  happens  one  of  the  vectors  emanating  from  nio  is  nearly  parallel  to  the  image  plane. 
When  the  rotation  matrix  R  is  computed,  insiccuracy  in  the  sines  affects  the  elements  ri2,  ria, 
and  r23  (see  Equation  49).  Since  ri2  is  affected,  when  the  sdution  is  used  to  transform  an 
unmatched  model  pcant,  the  instability  can  propagate  to  pdnts  that  lie  in  the  plane  containing 
the  three  matched  model  points,  which  is  not  true  for  the  solution  in  this  piq>er  (Section  6). 

For  Huttenlocher  and  Ullman’s  method  in  Section  8.5,  catastrophic  cancellations  may  occur 
in  step  1  when  /n,  /is,  /21,  and  lj2  arc  computed,  in  step  4  when  kt  and  kj  are  computed,  and 
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in  step  8  when  sR  is  computed.  Instability  in  step  4  can  affect  Ci  and  C2  in  Equation  49:  If  Ci  is 
near  zero,  then  ki  and  kz  in  Equation  66  also  must  be  near  zero.  From  Equations  63  and  64,  ki 
and  kz  are  computed  with  additions,  and  so  cancellation  can  occur  if  they  are  small.  Similarly, 
if  C2  is  near  zero,  ki  must  be  small  as  well,  and  so  again  cancellation  can  occur.  From  Fig.  8,  Ci 
or  C2  is  nearly  zero  when  one  of  the  vectors  emanating  from  the  origin  (tuq)  is  nearly  parallel 
to  the  image  plane. 

When  111,  hit  hit  hi  computed  (Equations  58-61),  the  results  will  be  inaccurate  if 
the  determinant,  xi'ffz  -  ^iVit  is  close  to  zero,  which  happens  exactly  when  the  model  points  are 
almost  coUinear.  In  addition,  if  any  of  /n,  hit  hit  or  hi  is  almost  zero,  then  canceUation  can 
occur  in  computing  it.  There  are  many  pairs  of  model  and  image  triples  that  can  make  one  or 
more  of  hit  hit  hit  hi  close  to  zero  (e.g.,  hz  %  0  whenever  Zi  %  Xi  and  Z2  %  Zzt  independent 
of  yi,  Vit  yit  and  j^).  Furthermore,  in  step  8,  the  additions  in  computing  srja  and  srz3  can 
also  contribute  to  instability  (see  Equation  68).  Note,  however,  that  the  image  triangle  being 
nearly  coUinear  does  not  necessarily  make  the  computation  unstable. 

In  Crimson  et  al.’s  solution,  instability  may  arise  in  step  5  if  cos^  is  almost  1,  in  step  6  if  t 
is  1  and  u)  is  close  to  and  in  step  7  if  cos  <f>  is  near  1  or  cos  $  is  near  1.  As  with  the  soluticm  in 
this  paper,  these  situations  occur  when  the  model  or  image  points  are  nearly  coUinear  or  when 
one  of  the  sides  of  the  model  triangle  that  emanates  from  nio  is  nearly  paraUel  to  the  image 
plane.  Like  UUman’s  method  and  Huttenlocher  and  UUman’s  methods,  however,  instabiUty  can 
propagate  to  points  inside  the  plane  of  the  matched  model  points  (in  step  7). 

In  summary,  each  of  the  three  previous  sdiutions  spreads  instabiUty  in  the  pose  solution 
to  points  in  the  plane  of  the  three  matched  model  points;  however,  the  solution  in  this  paper 
does  not.  Furthermore,  the  situations  in  which  instability  can  arise  are  the  same  for  UUman's 
method,  the  method  of  Crimson  et  al.,  and  the  solution  in  this  paper.  SpeciiicaUy,  these 
situations  are  when  one  of  the  vectors  from  fho  is  paraUel  to  the  image,  when  the  model  points 
are  nearly  coUinear,  and  when  the  image  points  are  nearly  coUinear.  Huttenlocher  and  UUman's 
method  is  unstable  in  the  first  two  situations  as  weU,  which  is  expected  since  in  these  situations 
the  problem  is  iU  conditioned  (Section  6).  In  addition,  Huttenlocher  and  UUman’s  method  can 
be  unstable  in  many  cases  where  the  other  methods  are  not,  but  iray  be  more  stable  in  the 
case  that  the  image  points  are  nearly  coUinear. 


10  Conclusion 

The  weak-perspective  three-point  problem  is  fundamental  to  many  approaches  to  model-based 
recognition.  In  this  paper,  I  iUustrated  the  underlying  geometry,  and  then  used  it  to  derive 
a  new  solution  to  the  problem  and  to  explain  the  various  special  cases  that  can  arise.  In 
particular,  the  times  when  there  are  zero,  one,  and  two  solutions  are  described  graphicaUy. 

The  new  solution  is  based  on  the  cUstances  between  the  matched  model  and  image  points 
and  is  used  to  recover  the  three-space  locations  of  the  model  points  in  image  coordinates.  From 
the  recovered  locations,  a  direct  expression  for  the  image  location  of  a  fourth  model  point  is 


33 


obtained.  In  contreist,  earlier  solutions  computed  an  initial  transformation  that  brought  the 
model  into  image  coordinates,  and  then  computed  an  additional  transformation  to  align  the 
matched  model  points  to  their  corresponding  image  points.  As  a  result,  the  solution  given  here 
should  be  easier  to  use,  and,  for  recognition  systems  that  repeat  the  computation  of  the  model 
pose  many  times,  should  be  more  efficient. 

Another  difference  with  the  method  presented  here  is  that  it  makes  evident  the  symmetry 
of  the  solution  with  respect  to  the  ordering  of  the  model  and  image  points.  Previous  methods 
that  are  based  on  the  coordinates  of  the  points  after  some  initial  transformations  make  this 
symmetry  unclear. 

Furthermore,  this  paper  provides  stability  analyses  for  both  the  new  and  past  solutions,  none 
of  which  had  been  analyzed  for  stability  previously.  Each  conq>utation  is  examined  for  places 
where  precision  may  be  lost.  From  these  places,  the  geometries  that  give  rise  to  instability  are 
inferred.  These  geometries  are  used  to  distinguish  instabilities  that  arise  in  situations  where 
the  problem  is  ill  conditioned,  that  is,  situations  where  instability  is  inherent,  from  ones  that 
are  due  to  the  particular  computation. 

In  9ving  another  solution,  this  paper  revisits  UUman’s  original  biquadratic  equation  for  the 
scale  factor,  but,  in  addition,  goes  on  to  interpret  both  solutions  to  the  equation,  and  to  prove 
which  one  is  correct.  The  false  solution  is  shown  to  correspond  to  inverting  the  roles  of  the 
model  and  inoage  points. 

Lastly,  the  new  solution  is  accompanied  by  a  proof  that  the  solution  exists  and  is  unique. 
Of  the  previous  methods,  only  Huttenlocher  and  UUman’s  demonstrates  this  as  well,  and  was 
the  first  to  do  so.  Such  proofs  may  be  useful  for  gaining  insights  into  related  problems  as  well 
as  the  problem  itself.  Even  so,  since  existence  and  uniqueness  have  been  established,  all  the 
solutions  are  valid,  and  should  all  be  considered  when  a  related  problem  needs  to  be  solved. 
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A  Rigid  Transform  between  3  Corresponding  3D  Points 

This  appendix  computes  a  rigid  transform  between  two  sets  of  three  corresponding  points  using 
right-handed  coordinate  systems  built  separately  on  each  set  of  three  points.  A  right-handed 
system  is  determined  by  an  origin  point,  o,  and  three  perpendicular  unit  vectors,  fu,  r,  to). 
Given  three  points  in  space,  ft,  ft,  ft,  we  can  construct  a  right-handed  system  as  follows:  Let 
fti  =  ft  -  ft  and  fta  =  ft  -  ft.  Then  let 

d  =  ft 
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“  = 

^  =  P02  -  (ftj  •  )ftl 
to  =  ux  V 

Let  (^;ui,vi,ti;i)  and  (^;uj, 02,102)  be  the  coordinate  systems  so  defined  for  the  original  and 
camera-centered  points,  respectively. 

Given  a  coordinate  system  (o;  2,0,  to),  a  rigid  transformation  that  takes  a  point  in  world 
coordinates  to  a  point  in  that  coordinate  system  is  given  by  (R.,  f),  where 

R  =  [u  o  25],  t  =  o 

(see  for  example  [6]);  the  transformed  pis  Kp+t.  Then  we  can  bring  a  point  p  from  the  original 
system  to  the  world  and  then  to  the  camera-centered  system  using 

^■2  +t2  =  RaRi^p-l-  <2  ~ 

where 

Ri  =  [ui  oi  toi],  h  =  di 

Ra  =  [22  02  t02],  ti  =  02. 

Consequently  a  rigid  transformation  (R,^  that  aligns  the  two  coordinate  systems  is 

R  =  RaRj^,  t  =  —  RaRi^ti.  (84) 

B  Biquadratic  for  the  Scale  Factor 

This  appendix  shows 

4(s*Rqi  —  doi)(s^i2o2  —  do2)  =  (<*(-Ri2  ~  -^oi  "  •R02)  “  (^7  “  *^1  ~  <^2))  (85) 

is  equivalent  to  a  biquadratic  in  s. 

Expanding  Equation  85, 

4  ('S'*-Ao1'^2  “  +  ^2<^i)  +  <^1*^2)  = 

+  -^02  ~  -^12)*  “  2s*(JZqi  -I-  Ro2  ~  ■Si2)(<loi  +  <^2  “  <^12) 

+(<^1  +  <^2  ~  <^12)* 
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(^^1-^02  ~  (^01  +  ^02  ~ 

—25*  +  2^2*^oi  ~  (-^i  -^02  ~  -®i2)(<^oi  *^2  *^12)) 

+  (4doi‘^2  ~  (<^1  +  ^2  “  <^12)^)  =  0 

05^  -  265*  +  c  =  0, 
where 

a  =  4AgjAQ2  ~  (-^01  -^02  ~  -^12) 

b  =  2B^idQ2  +  2i2o2*^i  ~  (^1  -^02  “  ^i2)(‘^i  ■*"  *^02  ~  ‘^12) 

c  =  i4id^,-i4x+42-dl2f. 

C  Two  Solutions  for  Scale 

This  appendix  proves  Proposition  1.  The  proof  uses  the  following  lemma: 

Lemma  :  Let  /  be  either  (^)  or  (fe)  •  •">'» 

af  -2bf  +  c<0.  (86) 

Proof: 

a/*  -  2b f  +  c 

=  4(i?oijRo2  *in^)*/*  “ 

2  (2(iloi<iJj  +  Aojdoi  -  2iloii2o2<ioido2  cos  ^  cos  V»))  /  + 

4(doidoasin^)*,  from  Equations  24,  25,  and  26 

=  4(ii*2i2j2(l-CO.*^)/*- 

(■^01*^  "b  ^2*^1  ■"  2Jloi-Roadoi<ioa  cos  ^  cos  ^)/  + 

<^i<^2(l- cos*^))  (87) 

Suppose  that  /  =  •  Then  87  becomes 

4  (-  cos*  ^  cos  ^cos  ^  -  doido2  cos* 

=  -4^2‘^i  ^  ~ 

\  JCoi  Mt02  / 
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Suppose  instead  that  /  =  .  Then  87  becomes 

4  (-  eos=  +  2^S!^  CO,  ^  CO,  ^  -  4,4,  co,^  V. 

\  -^02  ^02 

\  it02  ^01  / 

Either  way,  ap  -  2b f  +  c  <  0. 

□ 

Proposition  1:  Let 


- 


oc 


S2  = 


lb  +  \/62  _  ac 


Then 


<^31  tio2  ^ 

«!  <  -5-4 <«2- 
Jtoi  itoj 


Proof:  Starting  from  the  result  of  the  lemma. 


ap  -  26/  +  c  <  0 


i  ((a/  -  6)*  —  (6*  -  oc))  <  0 

(0/  —  6)^  <  b^  —  ac,  since  a  >  0 


\af  —  6|  <  y/i^  —  ac 


-(0/ -  6)  <  Vbi^  -  oc  and  af  —  b<  y/bl^  -  ac 


f> 


b  —  V6*  —  oc 


and  /  < 


6+  ■v/6^  —  oc 


doi  rfo3  ^ 

-5-  ^ 

i£oi  noa 


□ 
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D  One  Solution  for  Scale 


In  the  “one  solution”  case,  we  wish  to  know  when  and  if  6^  —  ac  =  0  holds.  Using  the  result  of 
Appendix  this  means  that 

4(^01^02)^  (t*  -  2cos(<^  +  il})t  +  ij  —  2cos(^  -  ip)t  +  1^  =  0. 

For  this  to  hold,  either 

—  2  cos{4>  +  ^)t  +  1  =  0  or  —  2  cos(<^  -  i>)t  +  1  =  0. 

Solving  for  t  gives 

t  =  coa{4>  +  ^)  ±  I  sin(^  +  V')  or  t  =  cos(4>  -  0)  ±  » sin(^  -  r/>),  (88) 

where  i  =  Consequently,  there  are  real  values  of  t  that  make  -  ac  =  0  only  if 

sin(^  +  0)  =  0  or  sin(^  -  ^)  =  0.  These  situations  occur  when  <f>  —  ±‘tt>  and  ^  =  ±yj)  +  t. 
Substituting  into  Equation  88  gives  that  IP  —  ac  =  0  iff  both  (f>  =  or  <j>  ~  ±V>  +  *■  and  t  =  1, 
where  t  =  1  is  the  same  as  ^ 


E  No  Solutions  for  Scale 

Tins  appendix  shows  that  there  always  exists  a  solution  to  the  biquadratic  by  showing  that 
4^  -  oc  >  0.  From  Appendix  F, 

-  ac  =  (<*  -  2  cp;f 4>  +  ^)t  +  l)  (t*  -  2  cos((4  -  rlf)t  +  1^ 

>  4(i2oi<io2)^  -  2t  +  1)  (t’  -  2t  +  1) 

=  MRcido7)*{t-l)* 

>  0 


F  Simplifying  -  ac 

In  this  appendix,  I  derive  that 

6*  -  oc  =  4(iZoido2)^  -  2co8(^  +  V')t  +  l)  (t*  -  2cos(^  -  ^)t  +  l)  ,  (89) 


where 


Rpidpi 

Roido2 
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From  Equations  24,  25,  and  26, 


a  =  4(^oijRo2  sin^)* 

b  =  2(i2oi^02  "I"  ^02^01  -  2i2oi^2doi<^2  cos  (t>  cos  V’) 

c  =  4(doido2sinV’)* 

Then 

6’  =  -  4-R22<^iiZoido2  cos  ^  cos  V'  +  252i^2d§id22  + 

4iioiito2^i^2  cos^  <f>cos^  yj}  —  4J2Q2d^2'^2doi  cos  4>  cos  V’  +  ^1*^2) 

ac  =  16^x-^02^oi^o2  ^  V’ 


-ac  =  4  (/Zo2<^i  -  4i?o2d2i-Roido2  cos  cos  V>+ 

(2  +  4  cos*  (ft  cos*  V*  -  4  sin*  ^  sin*  ^)R^\Ro2d^xd^  — 

4-^01  ^2 -^2^1  cos  <f>cos  ^  +  ^1^22) 

=  4(^ieio2)^  -  4cos^cos  V’t®  +  (2  +  4  cos*  ^  cos*  ^  -  4sin*flisin*  ^)t*- 

4  cos  0COS  V**  +  1)  >  where  t  — 

Ro\do2 

=  4(iZoido2)^  -  2  (cos(<^  +  ^)  +  cos(<^  -  ^))  t*+ 

(2  +  4  cos(^  +  cos(^  -  i>))  t*  -  2  (cos(^  +  ^)  +  cos(^  -  ri>))  f  +  1^ 

=  4(^idb2)^  -  2  cos(0  +  ^)t  +  1)  (t*  -  2  cos(^  -  V')*  +  l) 
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